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Abstract 



We introduce a constraint system called FT. This system offers a theoretical and practical 
alternative to the usual Herbrand system of constraints over constructor trees. Like Herbrand, 
FT provides a universal data structure based on trees. However, the trees of FT (called feature 
trees) are more general than the constructor trees of Herbrand, and the constraints of FT are 
of finer grain and of different expressiveness. The essential novelty of FT is provided by 
functional attributes called features which allow representing data as extensible records, a 
more flexible way than that offered by Herbrand's fixed arity constructors. The feature tree 
structure determines an algebraic semantics for FT. We establish a logical semantics thanks to 
three axiom schemes presenting the first-order theory FT. We propose using FT as a constraint 
system for logic programming. We provide a test for constraint unsatisfiability, and a test 
for constraint entailment. The former corresponds to unification and the latter to matching. 
The combination of the two is needed for advanced control mechanisms. We use the concept 
of relative simplification of constraints, a normalization process that decides entailment and 
unsatisfiability simultaneously. The two major technical contributions of this work are: (1) an 
incremental system performing relative simplification for FT that we prove to be sound and 
complete; and (2) a proof showing that FT satisfies independence of negative constraints, the 
property that conjoined negative constraints may be handled independently. 

Resume 

Nous presentons un systeme de contraintes appele FT. Ce systeme constitue une alternative 
theorique et pratique a Herbrand, le systeme usuel de contraintes sur les arbres a constructeurs. 
Comme Herbrand, FT fournit une structure de donnees d' arbres. Cependant, les arbres de FT 
(appeles arbres a traits) sont plus generaux que les arbres a constructeurs de Herbrand, et les 
contraintes de FT sont d'une granularite plus fine et d'expressivite differente. L'innovation 
essentielle de FT est due a des attributs fonctionnels appeles traits qui permettent de representer 
les donnees sous forme de structure d'enregistrement extensible, de maniere plus flexible que 
celle offerte par les constructeurs d'arite fixe de Herbrand. La structure d'arbre a traits 
determine une semantique algebrique pour FT. Nous etablissons une semantique logique grace 
a trois schemas d'axiomes presentant la theorie du premier ordre FT. Nous proposons d'utiliser 
FT comme un systeme de contraintes pour la programmation logique. Nous produisons un 
critere de satisfaisabilite de contrainte, et un critere de validation d'implication de contrainte. 
Le premier correspond a l'unincation et le deuxieme au filtrage. La combinaison des deux 
est necessaire pour des mecanismes de controle avances. Nous utilisons le concept de 
simplification relative, un processus de normalisation qui decide simultanement la validation 
d'implication et la non-satisfaisabilite. Les deux contributions techniques majeures de ce 
travail sont : (1) un systeme incremental effectuant la simplification relative pourFJ, que nous 
demontrons etre coherent et complet; et (2) une preuve montrant que FT jouit de 1' independance 
des contraintes negatives, propriete qui permet a des contraintes negatives conjointes d'etre 
traitees separement. 
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1 Introduction 

An important structural property of many logic programming systems is the fact that they 
factorize into a constraint system and a relational facility. Colmerauer's Prolog II [10] 
is an early language design making explicit use of this property. CLP (Constraint Logic 
Programming [12]), ALPS [18], CCP (Concurrent Constraint Programming [23]), and KAP 
(Kernel Andorra Prolog [11]) are recent logic programming frameworks that exploit this 
property to its full extent by being parameterized with respect to an abstract class of constraint 
systems. The basic operation these frameworks require of a constraint system is a test for 
unsatisfiability. In addition, ALPS, CCP, and KAP require a test for entailment between 
constraints, which is needed for advanced control mechanisms such as delaying, coroutining, 
synchronization, committed choice, and deep constraint propagation. LIFE [5, 6], formally 
a CLP language, employs a related, but limited, suspension strategy to enforce deterministic 
functional application. Given this situation, constraint systems are a central issue in research 
on logic programming. 

The constraint systems of most existing logic programming languages are variations and 
extensions of Herbrand [16], the constraint system underlying Prolog. The individuals of 
Herbrand are trees corresponding to ground terms, and the atomic constraints are equations 
between terms. Seen from the perspective of programming, Herbrand provides a universal 
data structure as a logical system. 

This paper presents a constraint system FT, which we feel is an intriguing alternative to 
Herbrand both theoretically and practically. Like Herbrand, FT provides a universal data 
structure based on trees. However, the trees of FT (called feature trees) are more general than 
the trees of Herbrand (called constructor trees), and the constraints of FT are of a finer grain and 
of different expressiveness. The essential novelty of FT is due to functional attributes called 
features, which provide for record-like descriptions of data avoiding the overspecification 
intrinsic in Herbrand's constructor-based descriptions. For the special case of constructor 
trees, features amount to argument selectors for constructors. 

Constructor trees are useful for structuring data in modern symbolic programming languages; 
e.g., Prolog and ML. This gives the more flexible feature trees an interesting potential. More 
precisely, feature trees model extensible record structures. They form the semantics of record 
calculi like [1], which are used in symbolic programming languages [5] and in computational 
linguistics (for example, see [3, 24] and the book [8]). Generally, these extensible record 
structures allow hierarchical representation of partial knowledge. They lend themselves to 
object-oriented programming techniques [3]. 

Let us suppose that we want to say that x is a wine whose grape is riesling and whose 
color is white. To do this in Herbrand, one may write the equation: 



with the implicit assumption that the first argument of the constructor wine carries the 
"feature" grape, the second argument carries the "feature" color, and the remaining 



x = wine(riesling, white, yi , ... , 



Yn) 
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wine 




color 
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riesling 



grape / \ \ year 
/ color\ 

riesling white 1988 




wine 




white 



color 



origin 



alsace 



Figure 1: Examples of Feature Trees. 



arguments yi, .. . , y n carry the remaining "features" of the chosen representation of wines. 
The obvious difficulty with this description is that it says more than we want to say; namely, 
that the constructor wine has n + 2 arguments and that the "features" grape and color are 
represented as the first and the second argument. 

The constraint system FT avoids this overspecification by allowing the description 



saying that x has sort wine, its feature grape is riesling, and its feature color is white. 
Nothing is said about other features of x, which may or may not exist. 

The individuals of FT are feature trees. A feature tree is a possibly infinite tree whose nodes are 
labeled with symbols called sorts, and whose edges are labeled with symbols called features. 
The labeling with features is deterministic in that all edges departing from a node must be 
labeled with distinct features. Thus, every direct subtree of a feature tree can be identified by 
the feature labeling the edge leading to it. The constructor trees of Herbrand can be represented 
as feature trees whose edges are labeled with natural numbers indicating the corresponding 
argument positions. 

Examples of feature trees are shown in Figure 1 . All but the second and third feature tree in 
Figure 1 satisfy the description (1). 

The constraints of FT are ordinary first-order formulae taken over a signature that accommo- 
dates sorts as unary predicates and features as binary predicates. Thus the description (1) is 
actually syntactic sugar for the formula: 



x: wine[grape riesling, color white] 



(1) 



wine(x) A 3y (grape(x,y) A riesling(y)) 
A 3y (color(x,y) A white(y)). 
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The set of all rational feature trees is made into a corresponding logical structure T by letting 
A(x) hold if and only if the root of x is labeled with the sort A, and letting/(x, y) hold if and 
only if x has y as direct subtree via the feature/. The feature tree structure T fixes an algebraic 
semantics for FT. 

We will also establish a logical semantics, which is given by three axiom schemes fixing a 
first-order theory FT. Backofen and Smolka [7] show that T is a model of FT and that FT is in 
fact a complete theory, which means that FT is exactly the theory induced by T. However, we 
will not use the completeness result in the present paper, but show explicitly that entailment 
with respect to T is the same as entailment with respect to FT. 

The two major technical contributions of this paper are (1) an incremental simplification 
system for entailment that is proven to be sound and complete, and (2) a proof showing that 
the "independence of negative constraints" property [9, 16, 17] holds for FT. 

The incremental entailment simplification system is the prerequisite for FTs use with either of 
the constraint programming frameworks ALPS, CCP, KAP or LIFE mentioned at the beginning 
of this section. Roughly, these systems are concurrent thanks to a new effective discipline 
for procedure parameter-passing that we could describe as "call-by-constraint-entailment" (as 
opposed to Prolog's call-by-unification). 

The independence property is important since it means that negative constraints on feature 
trees can be solved (exactly like in Colmerauer's work on disequations over infinite trees [9]). 
Namely, thanks to independence, a conjunction with more than one negated constraints 
<j> A A ... A -i0„ can be solved by testing separately each negated constraint fa for 
entailment, for i = l,...,n. This, of course, is done by our simplification system for 
entailment. 

One origin of FT is Ai't-Kaci's ^-term calculus [1], which is at the heart of the programming 
language LOGIN [3] and further extended in the language LIFE [5] with functions over feature 
structures thanks to a generalization of the concept of residuation of Le Fun [4]. 1 Other 
precursors of FT are the feature descriptions found in unification grammars [15, 14] developed 
for natural language processing, and also the formalisms of Mukai [19, 20] (for a thorough 
survey of precursors in this field, cf., [8]). These early feature structure formalisms were 
presented in a non logical form. Major steps in the process of their understanding and logical 
reformulation are the articles [22, 25, 13, 24]. Feature trees, the feature tree structure T, and 
the axiomatization of T were first given in [7]. The technique of relative simplification of 
constraints was first introduced and used in [6] to explain the behavior of functions as passive 
constraints in LIFE. 

The paper is organized as follows. Section 2 defines the basic notions and discusses the 
differences in expressivity between Herbrand and FT. Section 3 gives a basic simplification 
system that decides satisfiability of positive constraints. The material of Section 4 is not limited 

: Le Fun [4] is an extension of Prolog seen as a constraint logic programming system over Herbrand terms 
extended with applicative expressions. Le Fun's constraint solver achieves implicit coroutining thanks to an 
automatic suspension mechanism called "residuation" delaying equations with unsufficiently instantiated function 
arguments. Resumption is triggered asynchronously by function argument matching. 
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to FT but discusses the notion of incremental entailment checking and its connection with the 
independence property and negation. Section 5 gives the entailment simplification system, 
proves it sound, complete and terminating, and also proves that FT satisfies the independence 
property. 

2 Feature Trees and Constraints 

To give a rigorous formalization of feature trees, we first fix two disjoint alphabets S and T , 
whose symbols are called sorts and features, respectively. The letters A, B, C will always 
denote sorts, and the letters/, g, h will always denote features. Words over T are called paths. 
The concatenation of two paths v and w results in the path vw. The symbol e denotes the 
empty path, ve = ev = v, and J 7 * denotes the set of all paths. 

A tree domain is a nonempty set D C J 7 * that is prefix-closed; that is, if vw G D, then v G D. 
Thus, it always contains the empty path. 

A feature tree is a mapping t : D — > S from a tree domain D into the set of sorts. The paths in 
the domain of a feature tree represent the nodes of the tree; the empty path represents its root. 
The letters s and t are used to denote feature trees. 

When convenient, we may consider a feature tree fas a relation, i.e., t C T* X S, and write 
(w, A) G t instead of t(w) = A. (Clearly, a relation t C T* X S is a feature tree if and only 
if D = {w | 3A: (w,A) G t} is a tree domain and t is functional; that is, if (w,A) G t and 
(w, B) G t, then A = B.) As relations, i.e., as subsets of T* X S, feature trees are partially 
ordered by set inclusion. We say that s is smaller than (or, is a prefix-subtree of; or, subsumes; 
or, approximates) t iis C t. 

The subtree wt of a feature tree t at one of its nodes w is the feature tree defined by (as a 
relation): 

wt := {(v,A) | (wv,A) G t}. 

If D is the domain of t, then the domain of wt is the set w~ l D = {v \ wv G D}. Thus, wt is 
given as the mapping wt : w -1 D — > 5 defined on its domain by Htf(v) = ?(wv). A feature tree 
s is called a subtree of a feature tree t if it is a subtree s = wt at one of its nodes w, and a direct 
subtree ifweJ 7 . 

A feature tree t with domain D is called rational if (1) t has only finitely many subtrees and 
(2) t is finitely branching; that is: for every w G D, wT H D = {wf G D \ f G J 7 } is finite. 
Assuming (1), the condition (2) is equivalent to saying that there exist finitely many features 
fu---Jn such that D C {f u ... ,/„}*. 

Constraints over feature trees will be defined as first-order formulae. We first fix a first-order 
signature S W T by taking sorts as unary and features as binary relation symbols. Moreover, 
we fix an infinite alphabet of variables and adopt the convention that x, y, z always denote 
variables. Under this signature, every term is a variable and an atomic formula is either 
a feature constraint xfy (f(x, y) in standard notation), a sort constraint Ax (A(x) in standard 



November 1992 



Digital PRL 



A Feature Constraint System 



5 



notation), an equation x = y, _L ("false"), or T ("true"). Compound formulae are obtained as 
usual by the connectives A, V, — >, <->, -i and the quantifiers 3 and V. We use 3(j> and to 
denote the existential and universal closure of a formula cj>, respectively. Moreover, V{(j>) is 
taken to denote the set of all variables that occur free in a formula cj>. The letters (j> and if> will 
always denote formulae. In the following we will not make a distinction between formulae 
and constraints; that is, a constraint is a formula as defined above. 

S W T -structures and validity of formulae in S W ^"-structures are defined as usual. Since we 
consider only S W ^"-structures in the following, we will simply speak of structures. A theory 
is a set of closed formulae. A model of a theory is a structure that satisfies every formula of 
the theory. A formula (f> is a consequence of a theory T (T |= <j>) if holds in every model 
of T. A formula <j> is satisfiable in a structure A if 3(j) holds in A. Two formulae <j>, ip are 
equivalent in a structure A if V(^> <-> ip) holds in A. We say that a formula (f> entails a formula 
ip in a structure A [theory 7] and write <j> \=a ^> [(j) \=t ip] if V(^» — > V) holds in A; i.e., 
A |= V(^> — > V) [is a consequence of T; i.e., FT |= V(^» — > A theory T is complete if for 
every closed formula ^» either ^» or -i^ is a consequence of T. 

The feature tree structure T is the 5 W ^"-structure defined as follows: 

• the domain of T is the set of all rational feature trees; 

• t G A T if and only if t(e) = A (fs root is labeled with A); 

• ( s : t) G f r if and only iff G D s and t = fs (t is the subtree of s at/). 

Roughly, the Herbrand constraint y = A(x\,X2), where A is a binary constructor symbol, and 
the feature constraint Ay A y\x\ Ay2xi, where A is a sort and 1 , 2, ... are features, correspond to 
each other. (We will see later that this correspondance is a formal one for satisfiability, but nor 
for entailment.) Now it becomes clear what we mean by saying that feature constraints are finer 
grained. Also, feature trees are more general in the sense that they satisfy more constraints. 
For example, no constructor tree y satisfies bothv = A(x\,xi) andv = A{x\,X2,xz). 

Next we discuss the expressivity of our constraints with respect to feature trees (that is, with 
respect to the feature tree structure T) by means of examples. The constraint: 

^y(xfy) 

says that x has no subtree at/; that is, that there is no edge departing from x's root that is 
labeled with/. To say that x has subtree y at path/i • • •/„, we can use the constraint: 

3zi ■ ■ ■ 3z n -\{xfiZ\ A ZlflZl A ... A Zn-lfny)- 

Now let us look at statements we cannot express. One simple unexpressible statement is "v is 
a subtree of x" (that is, "3w: y = wx"). Moreover, we cannot express that x is smaller than 
y. Finally, if we assume that the alphabet T of features is infinite, we cannot say that x has 
subtrees at features /i , . . . ,/„ but no subtree at any other feature. In particular, we then cannot 
say that x is a primitive feature tree; that is, has no proper subtree. 

The theory FTq is given by the following two axiom schemes: 
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(Axl ) Vx My Mz (xfy Axfz — > y = z) (for every feature/) 
(Ax2) Vx (Ax A Bx — > _L) (for every two distinct sorts A and B). 

The first axiom scheme says that features are functional and the second scheme says that 
sorts are mutually disjoint. Clearly, T is a model of FTq. Moreover, FTq is incomplete (for 
instance, 3x(Ax) holds in T but not in other models of FT®). We will see in the next section 
that FTq plays an important role with respect to basic constraint simplification. 

Next we introduce some additional notation needed in the rest of the paper. This notation will 
also allow us to state a third axiom scheme that, as shown in [7], extends FTq to a complete 
axiomatization of T. 

Throughout the paper we assume that conjunction of formulae is an associative and commu- 
tative operator that has T as neutral element. This means that we identify (j> A (if) A 6) with 
6 A (ip A (f>), and (f> A T with (f> (but not, for example, xfy A xfy with xfy). A conjunction of 
atomic formulae can thus be seen as the finite multiset of these formulae, where conjunction is 
multiset union, and T (the "empty conjunction") is the empty multiset. We will write tp C (f> 
(or ip G (j), if ip is an atomic formula) if there exists a formula ip' such that ip A ip' = (j). 

We will use an additional atomic formula x/| ("f undefined onx") that is taken to be equivalent 
to -i3y (xfy), for some variable y (other than x). 

Only for the formulation of the third axiom we introduce the notion of a solved-clause, which 
is either T or a conjunction (f> of atomic formulae of the form xfy, Ax or xf] such that the 
following conditions are satisfied: 

1 . if Ax G (j> and Bx G (j>, then A = B; 

2. if xfy G (j) and xfz G (j), then y = z; 

3. if xfy G (j), thenx/t ^ (j). 

Given a solved-clause (j), we say that a variable x is dependent in (j) if (j) contains a constraint 
of the form Ax, xfy or xf\, and use W(</>) to denote the set of all variables that are dependent 
in (j). 

The theory FT is obtained from FTq by adding the axiom scheme: 
(Ax3) V 3X(j) (for every solved-clause <j> and X = W((j))). 

Theorem 1 The feature tree structure T is a model of the theory FT. 

Proof: We will only show that T is a model of the third axiom. Let X be the set of dependent 
variables of the solved-clause (j>, X = VV((j)). Let a be any T-valuation defined on V((j)) — X; we 
write the tree a(y) as t y . We will extend a on X such that T, a \= cj>. 
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Given x £ X, we define the "punctual" tree t x = {(e, A)}, where A £ <S is the sort such that Ax £ <j>, 
if it exists, and arbitrary, otherwise. Now we are going to use the notion of tree sum of Nivat [21], 
where w~ l t = {(wv, A) | (v, A) £ f} ("the tree t translated by w"), and we define: 

a(x) = \^j{w~ 1 t y | x-^ y for some y £ V(<£), w G J 7 *}. 

Here the relation is given by: x x, and x^Z> yif x-^> y' and //y £ ^, for some y' £ V(<fr) and 
some / £ T . Since: 

a(x) = \J{ W - 1 a(y)\ ...} 
and, for a node w of a(x), wa(x) = a(y), it follows that a(x) is a rational tree and that T, a \= cj>. | 

For another proof of this theorem see [7], which also proves that FT is a complete theory if the 
alphabets of sorts and features are infinite. 

A practical motivation for the assumption on the infiniteness of T (and of S as well) is the 
need to account for dynamic record field updates. It turns out that this semantical point of 
view has advantages in efficiency as well. Thus, the algorithms we present in this paper for 
entailment and for solving negative constraints on feature trees rely on the infiniteness of T 
and S. 

3 Basic Simplification 

A basic constraint is either _L or a possibly empty conjunction of atomic formulae of the form 
Ax, xfy, and x = y. The following five basic simplification rules constitute a simplification 
system for basic constraints, which, as we will see, decides whether a basic constraint is 
satisfiable in T. 

xfyAxfzAt 
xfz Ay = z A (f> 

„ Ax A Bx A <b 

2. A + B 

_L r 

Ax A Ax A <f> 
Ax A <f> 

4. . X = ^ - - x G V((j>) andx ^ y 
x = y A </)[x <- y] yv> Ty 

x = x A (f> 
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The notation (j)[x <— y] is used to denote the formula that is obtained from <j> by replacing 
every occurrence of x with y. We say that a constraint (p simplifies to a constraint ip by a 
simplification rule p if ^ is an instance of p. We say that a constraint (p simplifies to a constraint 
ip if either (p = ip or (p simplifies to ip in finitely many steps each licensed by one of the five 
simplification rules given above. 

Example 3.1 In order to check whether the two feature descriptions x\f =>• u : A] and 
y\f =>• v : A] are unifiable, in the sense of [3], we will simplify the basic constraint 

xfu A yfv A Au A Av A z = x A y = z. 

The following basic simplification chain, leads to a solved constraint (which, as shown 
in [24, 5], exhibits unifiability): 





xfu A yfv A Au A Av A z = x A y = z 


by Rule 4 = 


> xfu A yfv A Au A Av A z = x A y = x 


by Rule 4 = 


> xfu A xfv A Au A Av A z = x A y = x 


by Rule 1 = 


> xfvAAuAAvAu = vAz = xAy = x 


by Rule 4 = 


> xfvAAvAAvAu = vAz = xAy = x 


by Rule 3 = 


> xfvAAvAu = vAz = xAy = x 



Using the same steps up to the last one, the constraint xfu A yfv A Au A Bv A z = x A y = z 
simplifies to _L (in the last step, Rule 2 instead of Rule 3 is applied). 

Proposition 1 If the basic constraint (f> simplifies to ip, then FT® |= (f> <^> tp. 

Proof: The rules 3, 4 and 5 perform equivalence transformations with respect to every structure. 
The rules 1 and 2 correspond exactly to the two axiom schemes of FTq and perform equivalence 
transformations with respect to every model of FT 0 . | 

We say that a basic constraint <j> binds a variable xtoyifx = yG^» and x occurs only once in 
(p. At this point it is important to note that we consider equations as ordered; that is, assume 
that x = y is different from y = x if x ^ y. We say that a variable x is eliminated, or bound by 
(p, if (p binds x to some variable y. 

Proposition 2 The basic simplification rules are terminating. 

Proof: First observe that the simplification rules do not add new variables and preserve eliminated 
variables. Furthermore, Rule 4 increases the number of eliminated variables by one. Hence we know 
that if an infinite simplification chain exists, we can assume without loss of generality that it only 
employs the Rules 1, 3 and 5. Since Rule 1 decreases the number of feature constraints "xfy", which 
is not increased by Rules 3 and 5, we know that if an infinite simplification chain exists, we can 
assume without loss of generality that it only employs Rules 3 and 5. Since this is clearly impossible, 
an infinite simplification chain cannot exist. | 
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A basic constraint is called normal if none of the five simplification rules applies to it. A 
constraint ip is called a normal form of a basic constraint (p if (p can be simplified to ip and ip is 
normal. A solved constraint is a normal constraint that is different from _L. 

So far we know that we can compute for any basic constraint (p a normal form ip by applying 
the simplification rules as long as they are applicable. Although the normal form ip may not 
be unique for <j>, we know that <j> and ip are equivalent in every model of FTq. It remains to 
show that every solved constraint is satisfiable in T. 

Every basic constraint <j> has a unique decomposition <j> = fa A fa such that fa is a possibly 
empty conjunction of equations "x = y" and and (po is a possibly empty conjunction of feature 
constraints "xfy" and sort constraints "Ax". We call fa the normalizer and and (pc the graph 
of </>. 

Proposition 3 A basic constraint (p ^ _L is solved if and only if the following conditions hold: 

1. an equation x = y appears in (p only if x is eliminated in (p; 

2. the graph of (p is a solved clause; 

3. no primitive constraint appears more than once in (p. 

Proposition 4 Every solved constraint is satisfiable in every model of FT. 

Proof: Let ^ be a solved constraint and A be a model of FT. Then we know by axiom scheme Ax3 
that the graph <pa of a solved constraint <p is satisfiable in an fT-model A. A variable valuation a 
into A such that A, a \= 4>c can be extended on all eliminated variables simply by a(x) = a(y) if 
x = y £ <f>, such that A, a \= cj>. | 

The following theorem states that basic simplification yields a decision procedure for satisfia- 
bility of basic constraints. 

Theorem 2 Let ip be a normal form of a basic constraint (j). Then (j) is satisfiable in T if and 
only if tp ^ _L. 

Proof: Since cj> and ip are equivalent in every model of FT a and T is a model of FTq, it suffices 
to show that ip is satisfiable in T if and only if ip ^ _L. To show the nontrivial direction, suppose 
ip ^ _L Then ip is solved and we know by the preceding proposition that ip is satisfiable in every 
model of FT. Since T is a model of FT, we know that ip is satisfiable in T. | 

The next theorem implies the elementary equivalence of all models of FT with respect to 
satisfiability of basic constraints. Namely, satisfiability in any of the models of FT means 
satisfiability in all of them. Also, it is sufficient to test satisfiability in the model T alone. 
Finally, only the first two axioms are relevant for satisfiability. 
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Theorem 3 For every basic constraint <j> the following statements are equivalent: 
T |= 3(j> o 3 model A of FT 0 : A |= 3(j> O FT |= 3(j>. 

Proof: The implication 1 =>■ 2 holds since T is a model of FTq. The implication 3 =>■ 1 follows from 
the fact that T is a model of FT. It remains to show that 2 =>■ 3. 

Let <f> be satisfiable in some model of FTq. Then we can apply the simplification rules to <j> and 
compute a normal form ifi such that <j> and ifi are equivalent in every model of FTq. Hence ifi is 
satisfiable in some model of FTq. Thus ifi ^ _L, which means that ifi is solved. Hence we know by 
the preceding proposition that ifi is satisfiable in every model of FT. Since <j> and ifi are equivalent in 
every model of FTqCFT, we have that <j> is satisfiable in every model of FT. § 

4 Entailment, Independence and Negation 

In this section we discuss some general properties of constraint entailment. This prepares the 
ground for the next section, which is concerned with entailment simplification in the feature 
tree constraint system. 

Throughout this section we assume that A is a structure, 7 and (j> are formulae that can be 
interpreted in A, and that X is a finite set of variables. 

We say that 7 disentails (f> in A if 7 entails -i^> in A. If 7 is satisfiable in A, then 7 cannot both 
entail and disentail 3X(f> in A. We say that 7 determines (f> in A if 7 either entails or disentails 
(j) in A. 

Given 7, ^» and X, we want to determine in an incremental manner whether 7 entails or 
disentails 3X(f>. Typically, 7 will not determine 3X(f> when 3X(f> is considered first, but this 
may change when 7 is strengthened to 7 A 7'. To this end, we use the concept of relative 
simplification of constraints first introduced in [6]. The basic idea leading to an incremental 
entailment checker is to simplify cj> with respect to (relatively to) the context 7 and the local 
variables X. Given 7, X and (f>, simplification must yield a formula tp such that: 

7 3X(j) ~ 3Xfp. 

The following facts provide some evidence that this is the appropriate invariant for entailment 
simplification. 

Proposition 5 Let 7 \= A 3X(j> ^ 3Xip. Then: 

1. 7 |=^ 3X(j> if and only ifj \= A 3Xip; 

2. 7 |=_4 -\3X(j) if and only ifj |=_4 -\3X%p; 

3. if i/j = _L, then 7 |=_4 -\3X(j); 

4. if 3Xip holds in in A then 7 \= A 3X(j>. 
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Statements 1 and 2 say that it does not matter whether entailment and disentailment are decided 
for (f> or tp. Statement 3 gives a local condition for disentailment, and Statement 4 gives a 
local condition for entailment. The entailment simplification system for feature trees given in 
the next section will in fact decide entailment and disentailment by simplifying such that the 
condition of Statement 4 is met in the case of entailment, and that the condition of Statement 
3 is met in the case of disentailment. 

In practice, one can ensure by variable renaming that no variable of X occurs in 7. The 
next fact says that then it suffices if entailment simplification respects the more convenient 
invariant: 

A\=jA(j)-^jAtp. 
This is the invariant respected by our system (cf. Proposition 8). 

Proposition 6 Let XnV(j) = 0. Then: 

1. ifA\=jA(j)-^jAtp, then 7 \=a 3X(j> <-> 3Xip; 

2. 7 |=^ -i3X</> if and only ifj A (f> is unsatisfiable in A. 

That is, the conjunction 7 A (f> is satisfiable if and only if 7 either entails 3X(f>, or it does not 
determine 3X(j>. 

The independence of negative constraints [9, 16, 17] is an important property of constraint 
systems. If it holds, simplification of conjunctions of positive and negative constraints can be 
reduced to entailment simplification of conjunctions of positive constraints. In order to see 
why, observe that 7 A -1^1 A . . . A </>„ is unsatisfiable if and only if 7 entails </>i V . . . V </>„. 

To define the independence property, we assume that a constraint system is a pair consisting 
of a structure A and a set of basic constraints. From basic constraints one can build more 
complex constraints using the connectives and quantifiers of predicate logic. We say that a 
constraint system satisfies the independence property if: 

7 \=a 3X\(j>i V ... V 3X n (j) n if and only if3i: 7 |=^ 3X,-</>,- 

for all basic constraints 7, , . . . , (j) n and all finite sets of variables X\,...,X n . 

Proposition 7 If a constraint system satisfies the independence property, then the following 
statements hold (7, (/> and . . . , (/)„ are basic constraints): 

1. 7 A -i3Xi^»i A ... A -i3X n (j) n unsatisfiable in A if and only if3i: 7 |=^ 3X,^»,-; 

2. if 7 A -iBXi^i A. . . A^3X n (j) n is satisfiable in A then 7 A -*3X\(j)\ A... A^3X n (j) n |=^ 3X(j) 
if and only ifj |=^ 3X(j>. 
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5 Entailment Simplification 

We will now use the general setting of the previous section for the specific case of feature tree 
constraints. Throughout this section we assume that 7 is a solved constraint and X is a finite 
set of variables not occurring in 7. We will call 7 the context, the variables in X local, and all 
other variables global. Relative simplification is always carried out with respect to the context. 

If T is a theory and (fr and ifr are possibly open formulae, we write (fr \=j tfr (read: (fr entails ifr 
in T) if V(<£ ifr) holds in T. 

The next theorem expresses the same observations stated before Theorem 3 regarding 
disentailment rather than satisfiability. 

Theorem 4 For every basic constraint (fr, the following equivalences hold: 

7 1=7- ->3X(j) if and only ifj \=ft q ~>3X(fr if and only ifj \=ft -i3X(/). 

Proof: Implication "2 =>■ 3" holds since FTq CFT. Implication "3 =>■ 1" holds since T is a model of 
FT. To show implication "1 =>• 2", suppose 7 \=r ->3X(j). Then we know by Proposition 6 that 7 A <f> 
is unsatisfiable in T. Thus we know by Theorem 3 that 7 A <j> is unsatisfiable in every model of FTq. 
Hence we know by Proposition 6 that 7 |= F7 - 0 -3X<f>. | 

For every basic constraint <j> and every variable x we define: 



A basic constraint (j> is X-oriented if jc = y G ^» always implies jc G X or y X. A basic 
constraint ^ is pivoted if x = y G (j) implies that x is eliminated in <j> (and then y is a "pivot"). 

The following entailment simplification rules simplify basic constraints to basic constraints 
with respect to a context 7 and local variables X. 




y if x = y G (j) andx is eliminated; 
x otherwise. 



1. 



xfu A (j) 



yfv G 7 A 0, <?!>y = x 



u = v A (j) 



2. 



(j)u = (j)v A (j) 



xfu A yfv C 7, 

^» X-oriented and pivoted 



3. — Ax A By C j A (j), fix = (fry, A ^ B 

4. — - — Ay G 7 A ^, (fry = x 
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6. 



7. 



x = y f\ (f) i x ^ y, x G V(</>), 

x = y A (j>[x <- y] \ {x G X or y £ X) 

x = y A (j) 
y = x A (f> 

<f>[x <- y] 

x = x A ^ 



x^Z, yGX 
x = yG7, xGV(</>) 



We say that a basic constraint (j) simplifies to a constraint (j) with respect to 7 and X if (j) = i/j or 
^» simplifies to tp in finitely many steps each licensed by one of the eight simplification rules 
given above. The notions of normal and normal form with respect to 7 are defined accordingly. 



Example 5.1 Assume, in the context of functions in LIFE [6] (the case of guarded Horn 
clauses [18] is quite similar), that a function/wn is defined in the form/M«(z, z) — > . . ., and 
that it is called asfun(x\f u : A], y\f v : B]). That is, the actual parameter pair of 
feature descriptions (x\f =4> u : A] , y\f =^ v : B] ) has to be tested upon matching of (and 
incompatibility with) the formal parameter pair (z, z). (This is in order to know whether 
that function call fires, fails, or residuates.) As shown in [24, 5], this corresponds to testing 
whether the context 7 = xfu A yfv A Aw A Bv entails the guard 3z (x = z A y = z). 

Let X = {z}. Then we have the following simplification chain with respect to 7 and X: 

x = z. A v = z 
by Rule E6 ^> 7 ,x z. = x A y = z 
by Rule E5 =^ 7 ,x z = x A y = x 
by Rule E2 ^> 7 ,x u = vAz = xAy = x 
by Rule E3 =^ 7 ,x -L 

Let us now take as context 7 = xfu A yfv A Au. Then ^ = w = vAz = xAv = xis 
normal with respect to 7 and X. We shall see that this normal form tells us that 7 does not 
determine (j). If 7 gets strengthened either to 7 A Bv (as above), or to 7 A x = y, then the 
strengthened context does determine: it disentails in the first and entails in the second case. 
The basic normal form of 7 A x = y is yfu A Au A v = u A x = y; with respect to this context 
^ simplifies to z = y. 



In the previous example, (j> = z = xAy = x simplifies to^»i = w = vAz = xAy = x with 
respect to 7 = xfu A yfv A Au A Bv and X = {z}. This corresponds to a basic simplification as 
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follows: 



7 



A <j> 



by Rule B4 
by Rule Bl 



xfu Ayfv A Aw A Bv 
xfu A xfv A Au A Bv 
xfv A Au A Bv 



A u 



A Z 



A z 



x Ay 



x Ay 



v A z 



x 



x 



x Ay = x 



7 



A $ 



We observe that 7 A is equal to 7' A </>[, modulo renaming y by <j>\y = x and u by ^>i« = v, 
and modulo the repetition of x/v. 

Lemma 1 Lef (j> simplify to (f>\ with respect to 7 and X, not using Rule E6 (in an entailment 
simplification step). Then 7 A <j> simplifies to some 7' A <j>\ which is equal to 7 A <j>\ up to 
variable renaming and repetition of conjuncts. 

Proof: Clearly, each entailment simplification rule, except for E6, corresponds directly to a basic 
simplification rule (namely, El and E2 to Bl, E3 to B2, E4 to B3, E5 and E7 to B4, and E8 to B5). 

If the application of the entailment simplification rule to $ relies on a condition of the form cj)x = y 
or cj>x = <f>y where x ^ <frx or y ^ <fry, then x = <frx G <fr or y = <fry £ <f>, and Rule B4 is first applied to 
7 A <f>, eliminating x by <j>x (y by ^y). 

When comparing 7 A <£i and 7' A <£'j, renamings take account of these variable eliminations. Note 
that, if the rule applied to $ is E2, then 7' has one feature constraint xfv less than 7 — which, after 
renaming, has a repetition of exactly this constraint. | 

Proposition 8 If(f> simplifies to ip with respect to 7 andX, then 7 A (f> and 7 A ip are equivalent 
in every model ofFT$. 

Proof: Follows from Lemma 1 and Proposition 1. | 
Proposition 9 The entailment simplification rules are terminating, provided 7 and X are fixed. 

Proof: First we strengthen the statement by weakening the applicability conditions <j>y = x in 
Rules El and E4 to <j>y = <j>x. Then from Lemma 1 follows: (*) Each entailment simplification rule 
applies to cj>\ with respect to 7 and X if and only if it applies to <p[ with respect to 7' and X — except 
possibly for E5, when the corresponding variable has already been eliminated in an "extra" basic 
simplification step. 

If 7' has one conjunct of the form xfu less than 7, then (*) still holds; regarding a new application 
of E2 this is ensured by its (therefore so complicated. . . ) applicability condition. 

With Condition (*), it is possible to prove by induction on n: For every entailment simplification 
chain <j>,<j>i, . . . ,<j>„ with respect to 7 and X, there exists a 'basic plus Rule E6' simplification chain 
^'n+k' where k > 0 is the number of "extra" variable elimination steps. Since, 
according to Proposition 2, basic simplification chains are finite, so are entailment simplification 
chains. | 
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So far we know that we can compute for any basic constraint <j> a normal form if) with respect 
to 7 and X by applying the simplification rules as long as they are applicable. Although the 
normal form ip may not be unique, we know that 7 A <j> and 7 A ip are equivalent in every 
model of FTq. 

Proposition 1 0 For every basic constraint (f> one can compute a normal form tp with respect 
to 7 and X. Every such normal form if) satisfies: 7 \=q- 3X(/> if and only ifj \=q- BXi/j, and 
7 \=ft BX(j> if and only ifj \=ft 3Xif>. 

Proof: Follows from Propositions 8, 9, 6 and 5. | 

In the following we will show that from the entailment normal form rp of (p with respect to 
7 it is easy to tell whether we have entailment, disentailment or neither. Moreover, the basic 
normal form of 7 A (p is exactly 7 A ip in the first case (and in the second, where 7 A _L = _L), 
and "almost" in the third case (cf. Lemma 1). 

Proposition 1 1 A basic constraint (p ^ _L is normal with respect to 7 and X if and only if the 
following conditions are satisfied: 

1. (p is solved, X-oriented, and contains no variable that is bound by 7; 

2. if (px = y andxfu G 7, then yfv G" (pfor every v; 

3. if (px = (py andxfu G 7 and yfv G 7, then (pu = (pv; 

4. if (j>x = y and Ax G 7, then By G" <p for every B; 

5. if (j)x = (j)y and Ax G 7 and By G 7, then A = B. 

Lemma 2 If (j> ^ _L is normal with respect to 7 andX, then 7 A (f> is satisfiable in every model 
of FT. 

Proof: Let cj> ^ _L be normal with respect to 7 and X. Furthermore, let j = j N Aj c and cj> = <p N A<pa 
be the unique decompositions into normalizer and graph. Since the variables bound by 7^ occur 
neither in jc nor in <A> it suffices to show that jc A 4>n A <t>c is satisfiable in every model of FT. 

Let <Pn{7c) be the basic constraint that is obtained from j c by applying all bindings of <f> N . Then 
j c A<p N A (j> G is equivalent to 4>n A <Pn{jg) A <frc an d no variable bound by <fr N occurs in 4>n{ic) A 4>g- 
Hence it suffices to show that <Pn{7c) A 4>g is satisfiable in every model of FT. With Conditions 2-5 
of the preceding proposition it is easy to see that 4>n{ig) A ^ G is a solved clause. Hence we know by 
axiom scheme Ax3 that ^n{ig) A fie is satisfiable in every model of FT. | 

The following theorem states that relative simplification yields a decision procedure for 
disentailment of basic constraints. 
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Theorem 5 (Disentailment) Let if; be a normal form of <j> with respect to 7 and X. Then 
7 |=r -i3X</) if and only ifip = _L. 

Proof: Suppose -0 = _L. Then 7 \=t -3Xip and hence 7 |=r -i3X^ by Proposition 10. 

To show the other direction, suppose 7 \=r -3X<f>. Then 7 |=r -i3Xi/) by Proposition 10 and hence 
j Aip unsatisfiable in T by Proposition 6. Since T is a model of FT (Theorem 1), we know by the 
preceding lemma that ip = _L (since ip is assumed to be normal). | 

We say that a variable x is dependent in a solved constraint (f> if (f> contains a constraint of 
the form Ax, xfy or x = y. (Recall that equations are ordered; thus y is not dependent in the 
constraint x = y.) We use W(fa) to denote the set of all variables that are dependent in a 
solved constraint cj>. 

In the following we will assume that the underlying signature 5ttlf has at least one sort and 
at least one feature that does not occur in the constraints under consideration. This assumption 
is certainly satisfied if the signature has infinitely many sorts and infinitely many features. 

Lemma 3 Let (j>i, . . . , fabe basic constraints different from _L, and X\ , . . . , X n be finite sets of 
variables disjoint from V{~f). Moreover, for every i = 1 , . . . , n, let fa be normal with respect to 
7 andXi, and let fa have a dependent variable that is not in Xj. Then 7 A -i3X\fa A ... A ->3X n fa 
is satisfiable in every model of FT. 

Proof: Let 7 = 7^ A j c be the unique decomposition of 7 into normalizer and graph. Since the 
variables bound by 7^ occur neither in j c nor in any fa, it suffices to show that j G A -iSXifa A ... A 
-i3X n (j> n is satisfiable in every model of FT. Thus it suffices to exhibit a solved clause 6 such that 
7c C 6 and, for every i = 1, . . . , n, V(6) is disjoint with X, and 6 A is unsatisfiable in every model 
of/T. 

Without loss of generality we can assume that every X t is disjoint with V(7) and V(fa) — Xj for all j. 
Hence we can pick in every a dependent variable x, such that x, ^ Xj for any j. 

Let z\,...,Zk be all variables that occur on either side of equation x,- = y £ <p if i = 1 , . . . , n (recall 
that Xi is fixed for ;')■ None of these variables occurs in any Xj since every is X, -oriented. Next we 
fix a feature g and a sort B such that neither occurs in 7 or any . 

Now 6 is obtained from 7 by adding constraints as follows: if Ax, £ (/>,, then add Bx, ; if xrfy £ <j>i, 
then addX(/"f; to enforce that the variables z\, . . . , Zk are pairwise distinct, add: 

ZkgZk-l A ... A zigZ\ A z\g] . 
It is straightforward to verify that these additions to 7 yield a solved clause 6 as required. | 

Proposition 1 2 //</> is solved and W{fa) C X, then FT |= \i3Xfa 

Proof: Let <j> = <pff A <t>c be the decomposition of <j> in normalizer and graph. Since every variable 
bound by <j> is in X, it suffices to show that VBX^c is a consequence of FT. This follows immediately 
from axiom scheme Ax3 since <fr c is a solved clause. | 
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The following theorem states that relative simplification yields a decision procedure for 
entailment of basic constraints. 

Theorem 6 (Entailment) Let if; be a normal form of (f> with respect to 7 and X. Then 
7 |=r 3X^» if and only if ip ^ _L and W(ip) C X. 

Proof: Suppose 7 \=t 3X<j>. Then we know 7 \=t 3Xifi by Proposition 10, and thus 7 A -<3Xifi is 
unsatisfiable in T. Since 7 is solved, we know that 7 is satisfiable in T and hence that 7 A 3X0 is 
satisfiable in T. Thus ifi ^ _L. Since 7 A -i3X0 is unsatisfiable in T and T is a model of FT, we 
know by Lemma 3 that VV{ip) C X. 

To show the other direction, suppose ip ^ -L and VV(ip) C X Then fT |= V3X0 by Proposition 12, 
and hence T \= V3Xip. Thus 7 \=r 3Xip, and hence 7 \=r 3Xcj> by Proposition 10. | 

The next theorem shows that it does not matter whether entailment of basic constraints is 
interpreted in the algebraic semantics (i.e., in the feature tree structure T) or in the logical 
semantics (given by the axioms of FT). Now, of course, the third axiom is necessary (take, for 
example, as context the true constraint T). 

Theorem 7 Let (j>be a basic constraint. Then 7 \=q- 3X(j> if and only ifj \=ft 3X^>. 

Proof: One direction holds since T is a model of FT. To show the other direction, suppose 
7 \=t 3X<f>. Without loss of generality we can assume that <j> is normal with respect to 7 and X. 
Hence we know by Theorem 6 that <j> ^ _L and VV(ij)) C X. Thus FT \= V3X<^> by Proposition 12 
and hence 7 \= FT 3Xcj). | 

We finally show that our constraint system enjoys the property which allows one to solve 
conjunctions of negative constraints through relative simplification. 

Theorem 8 (Independence) Let fa, . . . ,(/>„ be basic constraints, and X\, . . . ,X n be finite 
sets of variables. Then: 

7 |=r 3Zi^»i V ... V 3X n (j) n if and only if3i: 7 |=r 3X,-^>,-. 

Proof: To show the nontrivial direction, suppose 7 \=r 3X\(j>\ V ... V 3X n (j>„. Without loss of 
generality we can assume that, for all i = 1 , . . . , n, Xi is disjoint from V(7), 4>i i s normal with respect 
to 7 andX,-, and <frj ^ _L. Since 7 A -iBX^i A ... A -i3X„<fr„ is unsatisfiable in T and T is a model of 
FT, we know by Lemma 3 that VV(cj)k) Q X& for some k. Hence 7 \=r 3X^k by Theorem 6. | 

6 Conclusion 

We have presented a constraint system FT for logic programming providing a universal data 
structure based on rational feature trees. FT accommodates record-like descriptions. We think 
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that these are superior to the constructor-based descriptions of Herbrand in that they allow 
expressing partial knowledge in a more flexible way. 

The declarative semantics of FT is specified both algebraically (the feature tree structure T) 
and logically (the first-order theory FT given by three axiom schemes). 

The operational semantics for FT is given by an incremental constraint simplification system, 
which can check satisfiability of and entailment between constraints. Since FT satisfies the 
independence property, the simplification system can also check satisfiability of conjunctions 
of positive and negative constraints. 

We see four directions for further research. 

First, FT should be strengthened such that it subsumes the expressivity of rational constructor 
trees [9, 10]. As is, FT cannot express that x is a tree having direct subtrees at exactly the 
features /i, . . . ,/„. It turns out that the system CFT [26] obtained from FT by adding the 
primitive constraint: 

x{fi, ...,/„} 

(x has direct subtrees at exactly the features /i, . . . ,/„) has the same nice properties as FT. In 
contrast to FT, CFT can express constructor constraints; for instance, the constructor constraint 
x = A(y, z) can be expressed equivalently as Ax A x{ 1 , 2} A xly A x2z, if we assume that A is a 
sort and the numbers 1 , 2 are features. 

Second, it seems attractive to extend FT such that it can accommodate a sort lattice as used 
in [1, 3, 4, 5, 25]. One possibility to do this is to assume a partial order < on sorts and replace 
sort constraints Ax with quasi-sort constraints [A]x whose declarative semantics is given as: 

[A]x = \J Bx. 

B<A 

Given the assumption that the sort ordering < has greatest lower bounds if lower bounds exist, 
it seems that the results and the simplification system given for FT carry over with minor 
changes. 

Third, the worst-case complexity of entailment of basic constraints checking in FT should be 
established. We conjecture it to be quasi-linear in the size of 7 and (j>, provided the available 
features (finitely many) are fixed a priori. 

Lastly, implementation techniques for FT at the level of the Warren abstract machine [2] need 
to be developed. 
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